CDS

Accession Number TCMCG075C12840
gbkey CDS
Protein Id XP_017974528.1
Location complement(join(18870374..18870494,18870938..18871403,18871699..18871902,18871993..18872145,18872245..18872332,18872428..18872484))
Gene LOC18601801
GeneID 18601801
Organism Theobroma cacao

Protein

Length 362aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018119039.1
Definition PREDICTED: probable sodium-coupled neutral amino acid transporter 6 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category E
Description sodium-coupled neutral amino acid transporter
KEGG_TC 2.A.18.6.4,2.A.18.6.5
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko02000        [VIEW IN KEGG]
KEGG_ko ko:K14207        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04724        [VIEW IN KEGG]
ko04727        [VIEW IN KEGG]
ko04974        [VIEW IN KEGG]
map04724        [VIEW IN KEGG]
map04727        [VIEW IN KEGG]
map04974        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAGCTTGCCTGCAACCATGAAAATTGTGGGGGTAGTTCCTGGAGTTGTCCTGATTTGCCTTGGAACCGTTGGAAAAGTTCTAGCTCAAGCTAGTATTATCATAAACAACATTGGCGGTCTGATTGTGTTTCTGATCATAATCGAGGATGTGCTATCAGGATCAACTTCCAGTGGGGTTCACCATGCTGGTATTTTGGAAGGCTGGTTTGGAGAACATTGGTGGACTAGTCGTGCCGTTGTTGTTCTAGTCCTAACAGCAGTTGTATTAGTTCCCTTTTTATGTTTCAAGCGCATTGATTCCTTAAGATTCACGTCTGCTATATCATTTGCATTGGCGGTTGTGTGTCTGGCTGTTGTTATTGGAATCACAATATACAAATTCATAATGGGGAGCATAGAGGCGCCTAAATATTTTCCTACTATTACCAATCTGTCATCCTTTTGGGAGCTCTTCACTGCTGTACCTGTTGTTATCTTTGCATACCTCTGCCACTATAATGTTCATCCAATTGCTAATGAGCTTGCTGACTCTCCTAGTATGCCAACAGTGGTGAAAACTTCAGTTGCTCTCTGCGCCATTGTGTATGTAATGACAGGCTTATTTGGGTTCTTCTTGTTTGGTGACTCCACTCTTTCTGACCTGTTGTCCAACTTCGACACTGATCTAGGCATACCATACAACTCCCTCTTCAATGATATTGTTCGAATCAGCTATGCAGGTCATATCATGCTTGTTTTCCCCATTATTTTCTTCCCTCTGCGCCTCAATGTGGATGGCCTCCTTTTTCCCTCAGCTGCACCTTTGTCTTCAGACAACTTAAGGTTTGGACTGGTCACTGTTGGGCTCATTGCCATTATTCTGCTAGGTGCAATATTCATTCCGAGCATATGGGTAGCGTTTGAGTTCACTGGAGCAACTGTTGGAGCTTTACTTGCCTTCATCTTTCCAGCCTGTATTACTCTCAAGGACCCTCATGGTATAGCAACGAAGAAGGATAAGATTTTATCCGTGTTCATGATCATTGTTGCAGTATTCTCAACTGTGGCAGCCATATACAGTGATGCATACTCTTTGTTAACAGCATGA
Protein:  
MSLPATMKIVGVVPGVVLICLGTVGKVLAQASIIINNIGGLIVFLIIIEDVLSGSTSSGVHHAGILEGWFGEHWWTSRAVVVLVLTAVVLVPFLCFKRIDSLRFTSAISFALAVVCLAVVIGITIYKFIMGSIEAPKYFPTITNLSSFWELFTAVPVVIFAYLCHYNVHPIANELADSPSMPTVVKTSVALCAIVYVMTGLFGFFLFGDSTLSDLLSNFDTDLGIPYNSLFNDIVRISYAGHIMLVFPIIFFPLRLNVDGLLFPSAAPLSSDNLRFGLVTVGLIAIILLGAIFIPSIWVAFEFTGATVGALLAFIFPACITLKDPHGIATKKDKILSVFMIIVAVFSTVAAIYSDAYSLLTA